Everything about Lexical Similarity totally explained
In
linguistics,
lexical similarity is a measure of the degree to which the word sets of two given
languages are similar. A lexical similarity of 1 (or 100%) would mean a total overlap between vocabularies, whereas 0 means there are no common words.
There are different ways to define the lexical similarity and the results vary accordingly. For example,
Ethnologue's method of calculation consists in comparing a standardized set of wordlists and counting those forms that show similarity in both form and meaning. Using such a method,
English was evaluated to have a lexical similarity of 60% with
German and 27% with
French.
Lexical similarity can be used to evaluate the degree of
genetic relationship between two languages. Percentages higher than 85% usually indicate that the two languages being compared are likely to be related "
dialects".
The lexical similarity is only an indication of the
mutual intelligibility of the two languages, since the latter also depends on the degree of phonetical, morphological, and syntactical similarity. It is worth noting that the variations due to differing wordlists weigh on this- for example, lexical similarity between French and English is considerable in lexical fields relating to culture, etc., whereas their similarity is smaller as far as basic (function) words are concerned. Unlike mutual intelligibility, lexical similarity can only be symmetrical.
Indo-European languages
The table below shows some lexical similarity values for pairs of
Indo-European languages.
Lang. code |
Language 1 ↓ |
Lexical similarity coefficients |
| cat |
Catalan |
1 |
| eng |
English |
- |
1 |
| fra |
French |
- |
0.27 |
1 |
| deu |
German |
- |
0.60 |
0.29 |
1 |
| ita |
Italian |
0.87 |
- |
0.89 |
- |
1 |
| por |
Portuguese |
0.85 |
- |
0.75 |
- |
- |
1 |
| ron |
Romanian |
0.73 |
- |
0.75 |
- |
0.77 |
0.72 |
1 |
| roh |
Romansh |
0.76 |
- |
0.78 |
- |
0.78 |
0.74 |
0.72 |
1 |
| rus |
Russian |
- |
0.24 |
- |
- |
- |
- |
- |
- |
1 |
| srd |
Sardinian |
0.75 |
- |
0.80 |
- |
0.85 |
- |
- |
0.74 |
- |
1 |
| spa |
Spanish |
0.85 |
- |
0.75 |
- |
0.82 |
0.89 |
0.71 |
0.74 |
- |
0.76 |
1 |
| Language 2 → | cat |
eng |
fra |
deu |
ita |
por |
ron |
roh |
rus |
srd |
spa
|
Notes:
- Language codes are from standard ISO/DIS 639-3.
- Ethnologue doesn't specify for which Sardinian variety was the lexical similarity calculated.
Further Information
Get more info on 'Lexical Similarity'.
|
External Link Exchanges
Do you know how hard it is to get a link from a large encyclopaedia? Well we're different and will prove it. To get a link from us just add the following HTML to your site on a relevant page:
<a href="http://lexical_similarity.totallyexplained.com">Lexical similarity Totally Explained</a>
Then simply click through this link from your web page. Our crawlers will verify your link, extract the title of your web page and instantly add a link back to it. If you like you can remove the words Totally Explained and embed the link in article text.
As long as your link remains in place, we'll keep our link to you right here. Please play fair - our crawlers are watching. Your site must be closely related to this one's topic. Any kind of spamming, dubious practises or removing the link will result in your link from us being dropped and, potentially, your whole site being banned. |